# Bidirectional Text-Image Generation
Harmon 1 5B
Harmon is an innovative unified multimodal understanding and generation framework that coordinates visual representations for understanding and generation through a shared MAR encoder, demonstrating excellent performance in text-to-image generation and multimodal understanding tasks.
Text-to-Image
Safetensors English
H
wusize
281
2
Versatile Diffusion
MIT
The first unified multi-stream multimodal diffusion framework supporting bidirectional image-text conversion and editing
Text-to-Image
V
shi-labs
8,455
48
Featured Recommended AI Models